Slowly Synchronizing Automata with Zero 
and Incomplete Sets 



Elena V. Pribavkina 

Ural State University, 620083 Ekaterinburg, Russia 
elena . pribavkina@usu . ru 



Abstract. Using combinatorial properties of incomplete sets in a free 
monoid we construct a series of n-state deterministic automata with zero 

2 

whose shortest synchronizing word has length 2j- + 2 _ 1, 
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1 Introduction 

Recall that a deterministic finite automaton srf = (Q, A, 6} is defined by specify- 
ing a finite state set Q, an input alphabet A, and a transition junction S : Q x A — > 
Q. The function S naturally extends to the free monoid A*; this extension is still 
denoted by <5. An automaton — (Q, A, 5} is said to be synchronizing (or reset) 
if there is a synchronizing (reset) word, that is a word w £ A* which takes all 
the states of &/ to a particular one: S(q, w) = S(q', w) for all q, q' 6 Q. 

Reset automata turn out to have various applications in different fields such 
as model-based testing of reactive systems, robotics, dna-computing, symbolic 
dynamics. In view of the applications an important question is about the length 
of the shortest reset word for a given synchronizing automaton. This issue has 
been widely studied over the past forty years, especially in connection with the 
famous Cerny conjecture [1] which states that any n-state synchronizing automa- 
ton possesses a synchronizing word of length at most (n — l) 2 . This conjecture 
has been proved for a large number of classes of synchronizing automata, nev- 
ertheless in general it remains one of the most longstanding open problems in 
automata theory. For more details see the surveys [5,9,10]. It is known (see for 
example [11, Proposition 3]) that the proof of the Cerny's conjecture reduces to 
proving it in two particular cases: for reset automata whose underlying graph is 
strongly-connected, i. e. each state is reachable from any other one, and for reset 
automata with zero, i. e. with a particular state such that <5(0, a) — for any 
a G A. Thus obtaining bounds for the maximal length of shortest synchronizing 
words for the class of reset automata with zero is a rather natural and interesting 
problem. 

It is clear that any synchronizing automaton with zero possesses a unique 
zero state, and any synchronizing word brings the automaton in the zero state. 
A rather simple argument shows that the length of a synchronizing word for 



a given n-state reset automaton with zero is at most 



i(n-l) 



see e. g. 



This 



bound is tight, since for each n there is an n-state reset automaton with zero 



and n — 1 input letters whose shortest reset word has length 



n(n— 1) 



(Fig. ED. 



A\{a 1 ,a 2 } A\{a 2 ,a 3 } J 4\{a 3 ,a 4 } A \ {a n _ 2 , a«-i} A\{a„_i} 





Fig. 1. An n-state reset automaton with zero over an n 
whose shortest reset word is of length "^" 2 1 - ) . 



1-lettered alphabet 



An essential feature of the example in Fig. [T] is that the input alphabet 
size grows with the number of states. This contrasts with the aforementioned 
examples due to Cerny [1] in which the alphabet does not depend on the state 
number. Thus a natural question is to determine the maximum length c m (n) of 
shortest reset words for n-state synchronizing automata with zero over a fixed 
m-lettered input alphabet as a function of n. 

Recently in [4] with the help of computer experiments P. Martjugin found a 
series of n-state automata with zero over a binary alphabet whose shortest reset 
words have length ^- + o(n 2 ). The main result of [4] can be stated as follows: 

Theorem 1. For each integer n > 8 there exists a synchronizing n-state au- 
tomaton with zero over a binary alphabet whose shortest synchronizing word has 
length 



;/~+6n — 16 
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Note that the construction from [4] is not trivial and beside that, it can 
not be extended on larger alphabets. Let us explain what we mean by such an 
extension. We say that a synchronizing automaton si = (Q, A, 5} is proper, if 
each letter of the alphabet A appears in each word synchronizing this automaton. 
Putting this another way, each letter is essential for synchronization of such an 
automaton. Naturally, it is the class of proper automata for which the problem 
of estimation of the function c m (n) should be considered, but adding new letters 
to the automata from Theorem [1] violates this property. 

The main result of the present paper is the following 

Theorem 2. Let A be an alphabet with \ A\ > 2. For any integer k > \A\ there is 
a proper synchronizing automaton with zero and n = 2k states over the alphabet 
A whose shortest synchronizing word has length 



n n 

T + 2- L 



Our construction leads to the same growth rate of the length of the shortest 
reset word as the construction from [4] , but essentially differs from it since there 
are no limitations on the number of input letters. Another important feature is 
that is was found not by a brute force, but as a result of analysis of interrelations 
between synchronizing automata with zero and incomplete sets in free monoids. 
Recall that complete and incomplete sets play a significant role in combinatorics 
on words and theory of codes in connection with the notion of a maximal code 
(see e.g. [7]). We think that the relation between combinatorial objects of dif- 
ferent nature (codes and automata) that we establish in the present paper is 
of a self-dependent interest. Recently such a relation has been independently 
discovered by Rampersad and Shallit in [6] where the computational complexity 
of some universality problems is studied. In particular the authors obtained a 
result similar to our Proposition [3] and a partial result of our Proposition [21 

The paper is organized as follows. In Section Owe introduce the notion of 
an incomplete set and study properties of incompletable words necessary for the 
proof of the main result. In Section [3] from a finite set of words A we build 
an automaton J^(A) with zero, recognizing the monoid X*. In Section 3] we 
establish the equality between words incompletable in X*, and words synchro- 
nizing the constructed automaton jF(A), which is used for the proof of the main 
Theorem EI 

2 Incomplete Sets 

To fix the notation let us recall the main definitions from combinatorics on words. 
By \w\ we denote the length of the word w, the length of the empty word A is 
equal to zero: |A| =0. By A + we denote the set of all non-empty words over 
the alphabet A, and by A k - the set of all words of length k over A. A word 
u G A + is a factor of w (prefix or suffix respectively), if w can be decomposed as 
w = xuy (w — uy or w — xu respectively) for some x,y A* . A factor (prefix, 
suffix) u of w is called proper if u ^ w. Given a word u = a\a2 ■ ■ ■ a n G A + 
by u[i . . . j] with 1 < i, j < n we denote the factor aia\+\ • • ■ a 3 if i < j, and 
the empty word if i > j. Moreover, we put u[0] — A. A word u G A* is called 
unbordered if none of its proper prefixes is its suffix. 

Let A be a finite set of words over the alphabet A. A word w G A* is said to 
be completable in X* if w is a factor of some word in X* (to put this another 
way, w can be "covered" by elements from X), otherwise w is incompletable. We 
say that the set X is complete if any word over A is completable in A*, otherwise 
the set X is said to be incomplete. Some properties of compete and incomplete 
sets were studied in [7] . Here we use the following result from [7] : 

Proposition 1. Let X C A*, k = maxlxl, and there is a word u of length k 

such that no element of X is a factor of u. Then the set X is incomplete, and 
the word w — (wa) fe " 1 u for an arbitrary letter a is incompletable. 

Next we consider the sets of the form A = A k \{u}, where u is some unbor- 
dered word and k > 2. The Proposition [T] implies the following 



Corollary 1. Any set of the form X = A k \ {u} with k > 2 and u G A k is 
incomplete. 

Lemma 1. Let X = A k \ {u} for some unbordered word word u. Any incom- 
putable word in X* is of the form v n uviu ■ ■ ■ uv m uv m+ i with Vi G A* \ A*uA* , 
m > 0. 

Proof. Let w be an incompletable word in X* , then u appears as a factor in 
w. Indeed, if it is not the case, consider the shortest word z G A* such that 
the length of the word wz is a multiple of k. Then wz can be decomposed as 
wz = xix 2 ■ • ■ x s with X\, X2, ■ ■ ■ , x s G A k (Fig. 2). Since w does not contain u 
as a factor for all i we have Xi ^ u, hence x\, x 2l ■ ■ ■ , x s G A k \ {u} and the word 
w is completable in X* , which is a contradiction. 



Suppose now that u appears just once as a factor of w, i.e. w = w'uw" and 
w', w" G A* \ A*uA* . Then the word u[2 . . . k]w" does not contain u as a factor 
since u is unbordered. From the previous argument we deduce that there is a 
word z G A* such that u[2...k]w"z G X*. In the same way, w'u[l] does not 
contain a as a factor, hence there is a word y G A* such that yw'u[V\ G X* 
(Fig. 3). Thus ywz G X*, i. e. also in this case w is completable in X*. We come 
to a contradiction. 



Therefore, the factor u occurs in w at least twice. Moreover, since u is 
unbordered these occurrences do not overlap, and w can be represented as 
w = vquviu ■ ■ ■ v m uv m+ i, where no Vi contains u as a factor, and m > 0. 

Next we give a necessary and sufficient condition for a word of the form 
w = v uviu- ■ -VmUVm+i to be incompletable. To this end we introduce some 
auxiliary notions. 

A position < i < k in the j-th occurrence of the word u in w is called 
forbidden if u[i + 1 . . . k]vju ■ ■ ■ v m uv m+ iz does not belong to X* for any z G A*. 
For the j-th occurrence of the word u in w, where j ranges from 1 to m, by Sj 
we denote the set of forbidden positions. Note that Sj C {0, 1, . . . k — 1} for all 
j's. A simple observation is the following 




Fig. 2. The word w does not contain u as a factor. 




Fig. 3. The word w contains it as a factor only once. 



Remark 1. A word w — vqiiviu ■ ■ ■ v m uv m +\ is incompletable in X* if and only 
ifSi = {0,l,...,*-l}. 

A set S = {s\, S2 ■ ■ ■ , s m } C N is called k -representative if among the residues 
modulo k of its elements there are all possible positive residues modulo k: 

{1, 2, . . . , k — 1} C {s\ mod k, S2 mod k, . . . , s m mod k}. 

Now we are ready to proof the criterion for a word w to be incompletable. 

Lemma 2. Let X = A k \ {u} for some unbordered word u. A word w = 
VQUV1UV2 ■ ■ ■ uv m uv m +i with Vi S A* \ A*uA*, m > is incompletable in X* 
if and only if the set 

{\vi\, \vi\ + \v 2 \,..., \v\\ + \V2\-\ h|f m |} 

is k-representative. 

Proof. Consider a word w = vquv\uv2 ■ ■ • uv rn uv m+ i. Since the set X contains 
only words of length k, for all j we have £ Sj. Moreover, since no proper prefix 
of u is its suffix we have 

Srn+l = {0}. 

Further, it is easy to see that if the set Sj is already defined, then a non-zero 
position i belongs to if and only if 

u[i + l...k]vj-xu[l...e\ e X*, ieSj. 

Therefore 0^i£ S m if and only if u[i + 1 . . . k]v m S X*, i. e. the length of the 
factor u[i + 1 . . . k]v m is divisible by k, thus i = \v m \ mod k. We get 

S m = {0, \v m \ mod A;}. 

Next suppose we have already calculated 

Sj = {0, \vj | mod k, (\vj \ + \vj+± |) mod fc, . . . , (\vj ! + ■•■ + \v m \) mod fc} (1) 

for < j < m. Let us prove that 

Sj-i = {0, \vj-i\modk, + modfc, . . . , , . 

(|^-i| + |^| + --- + |^m|)modfc}. 1 ' 

Let < i < k. Since X — A k \ {u}, and u is unbordered, there is an integer I 
such that < I < k and u[i + 1 . . . k]vj-iu[l .. .£] S A*, i. e. the length of this 
factor is divisible by k, hence 

i = (\vj—i | + £) mod k. 

Since ^ i S 5j_i if and only if £ G 5j, using ([T]) we obtain @. 
Therefore, 

S*! = {0, |ui| mod k, (\vi \ + \v 2 \) mod k, . . . , (\vi \ + ■ ■ ■ + \v m \) mod A:}. 
Taking into account Remark [1] we conclude the proof. 



Proposition 2. The shortest incompletable word for the set X = A k \ {u], 
where u is an unbordered word, has length k 2 + k — 1 . 

Proof. Proposition [T] (or Lemma [2]) implies that the word w — (ua) k ~ 1 u is 
incomletable in X* for any a £ A. Such a word has length k 2 + k — 1. 

Let w' be the shortest incompletable word for the set X* . Then by Lemma[T] 
it has the form vquv± . . . v m uv m +i with Vi G A* \ A*uA* and m > 0. Note 
that m > k — 1 and among Vi's (1 < i < m) there should be at least k — 1 
non-empty words (otherwise the set {|«i|, |t>i| + |u2 1 , ■ ■ ■ , \vi\ + • • ■ + \v m \} is 
not fc-representative, and by Lemma [2] the word w can be completed). Thus, 
\w'\ > k- 1 + k 2 = \w\. 

3 Construction of the Automaton from the set X 

In this Section we consider finite automata as devices for recognizing languages. 
Recall that to this end we choose an initial state Qq and a set F of terminal states. 
An automaton si = (Q, A, 5, Qq, F) is said to recognize a language L C A* if 



Let X be a finite set of words. By &{X) we denote an automaton recognizing 
the monoid X* , in which the initial state 1 is also the only terminal one, all the 
cycles pass through this state, and moreover, all the words from X label all possi- 
ble simple cycles. Such automata are known as semi-flower automata, see e. g. [2]. 
An example of a semi-flower automaton for the set X = {aa, ab, ba, bb, aab} is 
presented on Fig. 4. 



Fig. 4. The semi-flower automaton for X* , X = {aa, ab, ba, bb, aab}. 

In general a semi-flower automaton is nondeterministic. Recall that for such 
an automaton the deterministic transition function 6 : Q x A — > Q is replaced 
with the function S : Q x A — > 2^. Note that this definition does not exclude the 
possibility S(q, a) = for some q G Q and a G A (for instance, for the automaton 
in Fig. 4 we have 6(3, a) = 0). This function as in the case of deterministic 
automata, can be extended to the free monoid A*: if w = au, a G A, u G A + 
then we put 



L = {w G A* | 6(q ,w) G F}. 





Any semi-flower automaton can be completed by adding a new state and 
putting all previously undefined transitions to be equal (Fig. 5). Moreover, 
put 5(0, a) = for all a G A. The automaton obtained in this way is denoted by 




Fig. 5. The automaton ^(X) for X = {aa, ab, ba, bb, aab}. 

In particular case when X = A k \ {u}, the automaton &{X) will be denoted 
by &(h,u). 

4 Incomplete Sets and Synchronizing Automata 

The notion of synchronization of an automaton can be extended for the case of 
nondeterministic automata in different ways. Here we use one of such extensions 
which is known as strong synchronization [3]. 

A nondeterministic finite automaton &/ = (Q, A, 5} is said to be synchroniz- 
ing if there is a word w G A* and a state q G Q such that S(q', w) = {q} for any 
q' G Q. Putting this another way, all possible paths labeled by the word w from 
an arbitrary state of srf lead to the particular state q. 

The following proposition connects the notions of a word incompletablc in 
X* and a synchronizing word for the completed semi-flower automaton &{X). 

Proposition 3. Given an incomplete set X, the word w is incompletable in X* 
if and only if w is synchronizing for the automaton &{X). 

Proof. By the definition of &{X) the fact that the word w is not a factor of any 
word in A* means that it can not be read from any state of this automaton. 
Equivalcntly, this word brings to the state any state of the automaton &(X). 

Now we construct the automaton JP(k,u) = (Q, A, S, 1, {1}) for k > 2 and 
the unbordered word u = a\a2 ■ ■ ■ au- 

This automaton has n = 2k states Q = {0, 1, . . . , 2k — 1}, and the transitions 
are defined as follows (see Fig. 6). 

For 1 < i < k — lwe put 

5(i,a,i) = i + 1; 



5(i, b) = k + i for all b 6 A \ {a,}. 
6(k, at) = 0; 

S(k,b) = 1 for all b e A\{a k }. 
For k + 1 < i < 2k - 2 we put 
6(i, a) = i + 1 for all nei; 
5(2k - 1, a) = 1 for all a G A; 

5(0, a) = for all a e A. 

A\a k 




A 



Fig. 6. The automaton J^(fc, it) for k > 2 and it = ai<22 • • • dfc. 

Note that in this case the automaton ^(k,u) is deterministic; moreover by 
proposition [3] the word w = (ua) k ~ 1 u is its shortest reset word. Therefore it 
holds the following 

Proposition 4. Given an unbordered word u of length k > 2 and X = A k \ {u}, 
the shortest synchronizing word for the deterministic automaton JP(k,u) with 
zero has length k 2 + k — 1 . 

Proposition 5. Let k > \A\, and let u be an unbordered word of length k con- 
taining all the letters of the alphabet A. Then the automaton J^"(fc, u) is proper. 

Proof. Lemma [T] implies that the word u is a factor of any incompletable word 
for A k \ {u}, hence by Proposition [3] u is a factor of any reset word for the 
automaton J?(k, u). Since every letter of the alphabet A occurs in u, then every 
letter occurs in each reset word for ^(fc,w), thus by definition this automaton 
is proper. 



Propositions Q] and [5] imply the main theorem [5] 
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